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□ 1: XPJ53478. hypothetical prot...[gi:20909630] 



BLink, Links 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 
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KEYWORDS 

SOURCE 
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REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



LOC217730 197 aa linear ROD 17-MAY-2002 

hypothetical protein XP_153478 [Mus musculus] . 

XP_153478 

XP_15347 8 . 1 GI : 20909630 
REFSEQ: accession XM 153478.1 

Mus musculus (house mouse) 
Mus musculus 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Rodent ia; Sciurognathi ; Muridae; Murinae; Mus. 
1 (residues 1 to 197) 
NCBI Annotation Project. 
Direct Submission 

Submitted (15-MAY-2 002 ) National Center for Biotechnology 
Information, NIH, Bethesda, MD 20894, USA 

GENOME ANNOTATION REFSEQ : This model reference sequence was 
predicted from NCBI contig NW 000053 by automated computational 
analysis using gene prediction method: GenomeScan. 
Also see: 

Documentation of NCBI's Annotation Process 
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ORIGIN 



// 



Location/Qualif iers 
1. .197 

/organism= "Mus musculus" 
/strain="C57BL/6J" 
/ db_xre f = " t axon : 1 0 0 9 0 " 
/chromosome="12 " 
1. .197 

/product^ "hypothetical protein XP_153478" 
1. .197 

/gene="LOC217730 M 
/coded_by= "XM_153478 . 1 : 1 . . 594 " 
/db_xref="InterimID: 217730" 

1 metasaaprr vwpfhfaf la gflshpeths pgepgplprp pacpsvpfmw vaqwirlifv 

61 tldvlgsqpm kcpghvalhq dpgpwrpsg shgnalpldc lmgptshwnv skippfrqps 

121 kppltdgssq fqqqenpfra cpdfqlgkqm atiakarhpp syilfphpsl snsnawtnl 

181 rqnlssdssv sissqss 



CDS 



Disclaimer | Write to the Help Desk 
NCBI I NLM I NIH 



1 of 1 



5/20/03 3:16 PM 



NCBI Blast 



http://www.ncbi.nlm.nih.govMa^ 



is^^ 



<> NCBI 

Nucleotide 




protein-protein - 



Protein 



Translations 



Retrieve results for an 
RID 



Search 



pykwpfhf af fhqilatstiamshcrptvykqtsii 






pykwpfhf af fhqilatiamshcrptvykqtsii 












-1 


1- 





Set i 
subsequence rom * ' 



Choose 
database 



nr 



Do CD-Search □ 



Now: 



J To: 



or 



Options for advanced blasting 



Limit by entrez 
. _ query 

Composition-based 
statistics 



□ 



or select from: (none) 



Choose filter □ Low complexity Q Mask for lookup table only Q Mask lower case 



Expect [20000 | 



Word Size 2 M 



Matrix PAM30 



Gap Costs 



Existence: 9 Extension: 1 



PSSM 



Other advanced 
PHI pattern 



1 of 2 



5/20/03 3:08 PM 



NCBI Sequence Viewer 



wysi wyg://40/http://www. ncbi.nlm.nih.go...Protein&list_uids=20909630&dopt=GenPept 





0000000000 oJsfe "5 e V^Jw! A « 



PubMed 


Nucleotide 


Protein 


Genome 


Structure 


PMC 


Taxonomy 


OMIM 


Books 




□ l: XPJ53478. hypothetical prot...[gi:20909630] 



BLink, Links 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

DBSOURCE 

KEYWORDS 

SOURCE 

ORGANISM 



LOC217730 197 aa linear 

hypothetical protein XP__153478 [Mus musculus] . 
XP_153478 

XP_153478 . 1 GI :20909630 
REFSEQ: accession XM 153478.1 



ROD 17-MAY-2002 



Mus musculus (house mouse) 
Mus musculus 



Craniata ; Vertebrata ; Euteleos tomi ; 
Sciurognathi; Muridae; Murinae; Mus. 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

COMMENT 



ORIGIN 



// 



Eukaryota; Metazoa; Chordata; 
Mammalia; Eutheria; Rodent ia; 
1 (residues 1 to 197) 
NCBI Annotation Project. 
Direct Submission 

Submitted (15-MAY-2002) National Center for Biotechnology 
Information, NIH, Bethesda, MD 20894, USA 

GENOME ANNOTATION REFSEQ : This model reference sequence was 
predicted from NCBI contig NW 000053 by automated computational 
analysis using gene prediction method: GenomeScan. 
Also see: 

Documentation of NCBI • s Annotation Process 

Location/Qualifiers 
1. .197 

/organism= M Mus musculus" 
/strain="C57BL/6J" 
/db_xref="taxon: 10090" 
/ chromosome = " 12 " 

1..197 

/product= "hypothetical protein XP_153478" 
1. .197 

/gene="LOC217730" 
/coded__by="XM_153478 . 1 : 1 . . 594 " 
/db_xref = " InterimID : 217730 " 

1 metasaaprr vwpfhfafla gflshpeths pgepgplprp pacpsvpfmw vaqwirlifv 

61 tldvlgsqpm kcpghvalhq dpgpwrpsg shgnalpldc lmgptshwnv skippfrqps 

121 kppltdgssq fqqqenpfra cpdfqlgkqm atiakarhpp syilfphpsl snsnawtnl 

181 rqnlssdssv sissqss 



FEATURES 

source 



Protein 



CDS 



Disclaimer | Write to the Help Desk 
NCBI I NLM I NIH 



vl;\v }):2^:i; 



1 of 1 



5/20/03 3:16 PM 



RU> 1 0534569 1 5-026 125-1387, 



http://www.ncbi.nlm.nih.gov/blast/Blast.cgi 



results of 



BLASTP 2.2.6 [Apr-09-2003] 

Reference ; 

Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997) , 
"Gapped BLAST and PSI -BLAST: a new generation of protein database search 
programs", Nucleic Acids Res. 25:3389-3402. 

RID: 1053456915-026125-1387 

Query= 

(34 letters) 



Database: All non-redundant GenBank CDS 
translations+PDB+SwissProt+PIR+PRF 

1,435,412 sequences; 461,393,641 total letters 



If you have any problems or questions with the results of this search 
please refer to the BLAST FAQs 

No significant similarity found. For reasons why, click here. 



Database: All non-redundant GenBank CDS 
trans lations+PDB+SwissProt+PIR+PRF 

Posted date: May 20, 2003 1:52 AM 
Number of letters in database: 461,393,641 
Number of sequences in database: 1,435,412 

Lambda K H 

0.335 0.141 0 .505 

Gapped 

Lambda K H 

0.267 0.0410 0.140 



Matrix: BL0SUM62 

Gap Penalties: Existence: 11, Extension: 1 
Number of Hits to DB : 28,353,653 
Sequences : 1435412 
extensions : 452865 
successful extensions: 1347 
sequences better than 10.0: 0 
HSP's better than 10.0 without gapping: 0 
HSP's successfully gapped in prelim test: 0 
HSP's that attempted gapping in prelim test: 
HSP's gapped (non-prelim) : 0 
query: 34 

database: 461,393,641 
HSP length: 10 
length of query: 24 
length of database: 447,039,521 
search space: 10728948504 
search space used: 10728948504 



Number of 
Number of 
Number of 
Number of 
Number of 
Number of 
Number of 
Number of 
length of 
length of 
effective 
effective 
effective 
effective 
effective 
T: 11 
A: 40 

XI: 15 (7 
X2: 38 (14 



1347 



3 bits) 
6 bits) 
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http://www.ncbi.nlm.nih.gov/blast/Blast.cgi \ 



BLAST? 2.2.6 [Apr-09-2003] 

Reference : 

Altschul, Stephen F . , Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997) , 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search 
programs", Nucleic Acids Res. 25:3389-3402. 

RID: 1053457733-016164-25584 

Query= 

(36 letters) 



Database: All non-redundant GenBank CDS 
trans lations+PDB+SwissProt+PIR+PRF 

1,435,412 sequences; 461,393,641 total letters 



If you have any problems or questions with the results of this search 
please refer to the BLAST FAQs 

Taxonomy reports 
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RID=H 053457733-016164-25584, 



http://www.ncbi. nlmnih.gov/blast/B last, cgi 



Sequences producing significant alignments: 



qi [21287767 



gi|20909630 
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21 



21 



21 



21 



21 



21 



21 



21 



21 



21 



21 



21 



21 



21 



21 



21 



27695175 



21242454 



21 



21 



21 



21 



21 



21 



21 



21 



21 



21 



21 



21 



21 



21 



21 



21 



21 



21 



21 



21 



21 



21 



21 



21 



21 



21 



28900823 



30177923 



21231138 



23137395 



21245069 



24374681 



18398283 



10946860 



18462766 



27672343 



28871387 



23128861 



21293138 



9801567 



9801568 



9801569 



ref 



XP 153478.1 



ref 



XP 223369.1 



gb|EAA00088.l| ENSANGP00000017962 [Anopheles ga. 

hypothetical protein XP_153478. 
similar to low density lipopro. 
MFS transporter [Xanthomonas a. 
3-hydroxy-3-methylglutaryl CoA. 
gb[EAA45104 .1 | ENSANGP0000002 32 94 [Anopheles ga. 
NP 637055. l| MFS transporter [Xanthomonas c. 
ZP 00119101.1 | hypothetical protein [Cytoph. 



ref 



ref 



NP 642036.1 



NP 800478.1 



ref 



ref 



ref 



ref 



ref 



ref 



NP 644651.1 



NP 718724.1 



NP 565400.1 



NP 067441.1 



conserved hypothetical protein. 
DnaJ domain protein [Shewanell. 
F-box protein family, AtFBLlO . 
TATA box binding protein (Tbp) . 
gb 1 AAL72538 . 1 | hypothetical protein [Shigella f. 

similar to hypothetical protei. 

hypothetical protein [Pseudomo. 

ZP 00110699. l| hypothetical protein [Nostoc . 



ref 



ref 



ref 



XP 221755.1 



NP 794006.1 



gb|EAA05283.l| 



cjb 



_b 



gb 



AAF97943.2 



AAF97944 .2 



AAF97945 .2 



28564982 



15078968 



15641102 



19114288 



9801570 



9801564 



ref 



ref 



ref 



2__ 



2^ 



NP 149719.1 



NP 230734.1 



NP 593376.1 



AAF97946.2 



AAF97940 .2 



ENSANGP00000008298 [Anopheles ga. 
cytochrome P450 CYP6N4v3 [Aedes a. 
cytochrome P450 CYP6N4v4 [Aedes a. 

, _ cytochrome P450 CYP6N4v5 [Aedes a. 

gb | AA038684 . 1 | truncated PDR5 [Saccharomyces kl . 

256R [Invertebrate iridescent . 
periplasmic binding protein-re. 
hypothetical protein [Schizosa. 
cytochrome P450 CYP6N4v6 [Aedes a. 

cytochrome P450 CYP6N4vl [Aedes a. 

729930 | sp | P16270 | LECN PEA NONSEED LECTIN PRECURSOR >gi | 3 . 

NP 842957. l[ hypothetical protein [Bacillus. 
CAD31853 . 1 1 putative chemosensory receptor . 

conserved hypothetical protein. 
Beta-1 , 3 -N-acetylglucosaminylt . 
probable glycosyl transferase . 
Hypo the t i c al protein XP_2 113 09 . 
chromosomal region maintenance . 

hypothetical protein XP_242294. 

S20988 lectin - garden pea >gi | 169172 | gb | AAA. 
T29765 hypothetical protein ZC581.4 - Caeno. 
NP654351 . 1 | hypothetical protein predicted. 
AAD45549 . 1 |U70376 14 SpcE [Streptomyces netro. 



30260580 



22293495 



26987557 



30023459 



17230565 



27500354 



13812138 



27706584 



ref 



emb 



ref 



ref 



ref 



ref 



ref 



ref 



100052 |pir 



7510844 



PIT 



21398366 ref 



NP 742982.1 



NP 835090.1 



NP 487113.1 



XP 211309.1 



NP 113265.1 



XP 242294.1 



5579428 |gb 



630666 |pir 



28917410 



23612355 



15802965 



16130359 



17297993 



19113394 



13476001 



30020491 



22298512 



21292670 



23059042 



15964592 



28901509 



17553568 



15240890 



15889786 



23307111 



30176975 



18860525 



27753111 



_b 



ref 



S40748 hypothetical protein F54C8.6 - Caenor. 
EAA27114.ll hypothetical protein [Neurospora. 



ref 



ref 



dbj 



ref 



ref 



ref 



ref 



NP 703935.1 



NP 288995.1 



NP 416929.1 



NP 596602.1 



transportin [Plasmodium falcip. 
Z3699 gene product [Escherichi. 
orf, hypothetical protein [Esc. 
BAB78507.l| transportin [Plasmodium falcipa. 

SNF2 family dna repair protein. 
ABC transporter, substrate bin. 
hypothetical protein [Bacillus. 
ORF_ID: tlr0969-probable membra. 
gb|EAA04815.l| ENSANGP000000 19532 [Anopheles ga. 
ZP 00084044.1 | hypothetical protein [Pseudo. 



NP 107571.1 



NP 832122.1 



NP 681759.1 



ref 



ref 



ref 



ref 



ref 



ref 



dbj 



NP 384945.1 



NP 801164.1 



NP 499077.1 



NP 195730.1 



NP 355467.1 



PROBABLE RIBONUCLEASE HI I PROT. 
putative ferrichrome ABC trans. 
Predicted CDS, putative membra, 
expressed protein [Arabidopsis . 
AGR_C_4580p [Agrobacterium turn. 
BAC16541 . 1 | ferric vibrioferrin transport s. 
gb[EAA08960.2 [ ENSANGP0000001963 9 [Anopheles ga. 
NP 573365. l| CG7876-PA [Drosophila melanoga. 
CAA84735 .2 | C. elegans SRG-11 protein (corr. 



ref 



emb 



Score E 
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RID= 1 053457733-0 1 6 1 64-255 84, 



http://www.ncbi.nlranih.gov/blast/Blast.cgi 



2i 

ai 
ai 

Si 

si 

a* 
ai 



21 
2l 
22, 

ai. 

22, 
2l 
2l 
2i 
21. 
21 
2l 
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21 
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ai 
2i 
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25342678 



30157001 



27707330 



15611696 



15217347 



£11 



ref 



ref 



ref 



3b 



630612 |pir 



16944534 



30179202 



27685447 



| B88555 protein F54C8.6 [imported] - Caenor. 



XP 291865.2 



XP 242328.1 



.NP 223347 1 
AAK92 685 . 1 | ACQ 90 7 14 18 



S44629 



similar to olfactory receptor . 
hypothetical protein XP_242328. 
putative TYPE II DNA MODIFICAT. 

putative retrotranspo . 

F22B7.10 protein - Caenorhabditis ele. 

emb | CAD11 3 3 0 . 1 [ hypothetical protein [Neurospor. 
gb|EAA45649.l| ENSANGP00000024832 [Anopheles ga . 

serine (or cysteine) proteinas . 
This CDS feature is included to s. 
24666563 [ref |NP 649079.2] CG6836-PA [Drosophila melanoga. 
17861760 [gb[ AAL3 93 57 . 1 | GH26215p [Drosophila melanogaster] 
121111 | sp | P07092 | GDN RAT Glia derived nexin precursor (G. 

gblEAA45650.l| ~ ENSANGP0 0000025224 [Anopheles ga . 
ref|NP 186830. l| expressed protein [Arabidopsis . 

[Anopheles gambiae 



ref|XP 217453. 1| 



14 773 86 |gb|AAC3733 8.l| 



30179203 



15232200 



13660729 



28515296 



26336525 



30840221 



17552842 



30179201 



20837204 



29349732 



15131119 



23273294 



gblAAK32 960.l| 



ref 



dbj 



emb 



ref 



XP 288452.1 



BAC3194 5.1 



CAD61285.1 



NP 498909.1 



gb|EAA45648.l| 



ref 



ref 



emb 



XP 158536.1 



NP 813235.1 



CAC48190.ll 



gb|AAH34950.l| 



7019547 | ref |NP 037386. lj 



23058842 



26991318 



23042124 



15791673 



26655514 



14276199 



14276231 



22957373 



17543364 



15601658 



27718495 



21347201 



15800733 



17456139 



ref 



ref 



ref 



ref 



3b 



3b 



3b 



3b 



ZP 00083865. 1| 



NP 746743.1 



ZP 00073532 



NP 281496.1 



AAN85858.1 



AAK58140.1 



AAK58156.1 



ZP 00005078 



ref 



ref 



ref 



NP 501353.1 



NP 233289.1 



XP 216902.1 



gb|AAL50224.l| 



ref NP 286747.1 



reflXP 051362.2 



cytochrome P450 

hypothetical protein XP_288452. 
unnamed protein product [Mus mu. 
fertility restorer [Raphanus sa. 
DumPY : shorter than wild-type. 
ENSANGP00000025060 [Anopheles ga . 
hypothetical protein XP__158536. 
conserved hypothetical protein. 
NADH dehydrogenase subunit 2 [L. 
TANK-binding kinase 1 [Homo sapi . 

TANK-binding kinase 1; NF-kB-ac. 
l| hypothetical protein [Pseudo. 
_ conserved hypothetical protein. 
. 1 1 hypothetical protein [Tricho. 

hypothetical protein Cj 0305c [. 
NADH dehydrogenase subunit II [M. 
NADH dehydrogenase subunit II [M. 
NADH dehydrogenase subunit II [B. 
1 1 hypothetical protein [Rhodoba. 
Putative. nuclear protein, with, 
permease [Vibrio cholerae] >gi . 
similar to TANK-binding kinase. 
NADH dehydrogenase subunit II [M. 
unknown [Escherichia coli 0157. 
similar to Transmembrane 6 sup. 



Alignments 
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Get selected sequences 



Select all 



Deselect all 



□ > gi [21287767 jgb | EAA00088. 1 ] ENSANGP00000017962 [Anopheles gambiae str. PEST] 

Length =314 

Score = 31.2 bits (66), Expect = 1.6 

Identities = 19/47 (40%), Positives = 23/47 (48%), Gaps = 18/47 (38%) 

Query: 4 WPFHFAFFH Q ILATSTI- -AMSHCRPTVYKQTSII 36 

WP+H A FH Q IL +TI A SHC VY Q+ +1 
Sbjct: 52 WPWHVALFHRKDAQYEYACGGS ILDENTILTA- SHC VYTQSGVI 94 

□ > gi|20909630|ref [XP 153478. l| hypothetical protein XP_153478 [Mus musculus] 

Length = 197 

Score = 29.9 bits (63), Expect = 3.8 
Identities = 7/7 (100%), Positives = 7/7 (100%) 
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RID= 1 053457733-0 1 6 1 64-25584, 



http://www.ncbi.nlm.nih.gov/blast/Blast.cgi 



Query: 4 WPFHFAF 10 

WPFHFAF 
Sbjct: 12 WPFHFAF 18 

D > gi | 2 7695175 1 ref | XP 22 3 3 69 . 1 1 similar to low density lipoprotein receptor-relat 

low density lipoprotein-related protein 4; Low Density 
Lipoprotein Receptor Related Protein 4; corin [Mus 
musculus] [Rattus norvegicus] 
Length = 276 

Score = 28.2 bits (59), Expect = 12 
Identities = 7/10 (70%) , Positives = 9/10 (90%) 

Query: 6 FHFAFFHQIL 15 

FHF+ FHQ+L 
Sbjct: 2 FHFSLFHQVL 11 

□ > gi | 21242454 j ref [NP 642036. l| MFS transporter [Xanthomonas axonopodis pv. citri 
gi 1 21107 8 99 | gb | AAM3 6572 . 1 [ MFS transporter [Xanthomonas axonopodis pv. citri str. 

Length = 474 

Score =28.2 bits (59), Expect = 12 
Identities = 10/17 (58%) , Positives = 12/17 (70%) 

Query: 6 FHF AFFHQ I LATS T I AM 22 

F FAFF Q LA T+A+ 
Sbjct: 167 FAFAFFVQFLAVPTVAL 183 

□ > gi [ 28900823 | ref |NP 800478. l| 3 -hydroxy-3 -methylglutaryl CoA reductase [Vibrio p 

RIMD 2210633] 

gi 1 28809269 j dbj |BAC62311.l| 3 -hydroxy- 3 -methylglutaryl CoA reductase [Vibrio para 
Length =42 0 

Score =27.8 bits (58), Expect = 16 

Identities = 10/18 (55%), Positives = 11/18 (61%), Gaps = 5/18 (27%) 

Query: 11 FHQILATSTIAMSHCRPT 2 8 

FH I LATS I +PT 
Sbjct: 225 FHHILATSPI KPT 237 

□ > gi | 3 0177923 [gb|EAA45104 . 1 1 ENSANGP00000 0232 94 [Anopheles gambiae str. PEST] 

Length = 2 32 

Score =27.8 bits (58), Expect = 16 

Identities = 15/36 (41%) , Positives = 17/36 (47%) , Gaps = 15/36 (41%) 

Query: 4 WPFHFAFFHQ ILATSTI - - AMSHC 25 

WP+H A FHQ IL +TI A SHC 

Sbjct: 1 WPWHAAIFHQKDKHKEYACGGSILDETTILTA-SHC 35 

□> gi | 21231138 1 ref |NP 637055. l| MFS transporter [Xanthomonas campestris pv. campes 
33913] 

gi | 21112 775 | gb | AAM4 0979 . 1 [ MFS transporter [Xanthomonas campestris pv. campestris 
33913] 
Length = 468 

Score =27.8 bits (58), Expect = 16 
Identities = 10/18 (55%) , Positives = 13/18 (72%) 

Query: 6 FHFAFFHQILATSTIAMS 23 

F FAFF Q LA + +A+S 
Sbjct: 166 FAFAFFLQFLAVPSVALS 183 
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□> gi|23137395|ref | ZP 00119101. l| hypothetical protein [Cytophaga hutchinsonii] 
Length =414 

Score = 27.4 bits (57), Expect = 22 

Identities = 13/35 (37%) , Positives = 16/35 (45%) , Gaps = 19/35 (54%) 

Query: 7 HF AFF HQIL ATS T IAN 22 

HF AFF H++L +TSTIAM 

Sbjct: 264 HFVIGAFFGAMLLSHELLGKENFHIVEKSTSTIAM 298 

□ > gi |21245069|ref [ NP 644651. l| conserved hypothetical protein [Xanthomonas axonop 
str. 306] 

gi | 211108 02 [gb|AAM39187 . 1 [ conserved hypothetical protein [Xanthomonas axonopodis 
str. 306] 
Length =12 9 

Score = 27.4 bits (57), Expect = 22 
Identities = 7/9 (77%), Positives = 7/9 (77%) 



Query: 3 KWPFHFAFF 11 

KW F FAFF 
Sbjct: 12 KWQFRFAFF 2 0 



□ > gi [24374681 1 ref |NP 718724. l[ DnaJ domain protein [Shewanella oneidensis MR-1] 
gi |2434 932 6 | gb | AAN56 168.1 | AE015754 2 DnaJ domain protein [Shewanella oneidensis M 
Length =4 02 

Score = 26.9 bits (56), Expect = 30 

Identities = 9/11 (81%), Positives = 9/11 (81%), Gaps = 1/11 (9%) 



Query: 6 FHFAFFHQILA 16 

FH AFF QILA 
Sbjct: 170 FH-AFFEQILA 179 



□ > gi | 18398283 [ref [NP 565400.1 



34 



21 



25372906 



13605809 



22137200 



pir | J.B84547 



3b 



AAK32890 . 1 



AAM91445 . 1 



F-box protein family, AtFBLlO [Arabidopsis thalian 
hypothetical protein At2gl7020 [imported] - Arabidopsis t 
AF367303 1 At2gl7020 [Arabidopsis thaliana] 



Length = 656 



At2gl7020/At2gl7020 [Arabidopsis thaliana] 



Score = 26.9 bits (56), Expect = 30 

Identities = 9/14 (64%) , Positives = 12/14 (85%) , Gaps = 1/14 (7%) 



Query: 11 FHQILATSTIAMSH 24 

FH ILAT T+++SH 
Sbjct: 370 FHDILAT-TLSLSH 382 



□ > gi|l0946860|ref |NP 067441. l| TATA box binding protein (Tbp) -associated factor, 

A [Mus musculus] 
gi | 18422 04 |emb|CAA71091. 1 | TAFI48 protein [Mus musculus] 

Length = 453 

Score = 26.9 bits (56), Expect = 30 

Identities = 7/9 (77%), Positives = 8/9 (88%), Gaps = 1/9 (11%) 

Query: 4 WP-FHFAFF 11 

WP FHF+FF 
Sbjct: 380 WPAFHFSFF 3 88 

□ > gi 1 184 62 766 [gb[ AAL7253 8. 1 1 hypothetical protein [Shigella flexneri 2a] 

Length = 13 9 



Score = 26.9 bits (56), Expect = 30 
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Identities = 11/17 (64%), Positives = 13/17 (76%), Gaps = 2/17 (11%) 

Query: 14 ILATSTIAMSHCRPTVY 3 0 

ILA+S I+M H R TVY 
Sbjct: 87 ILASSKISMLH-R-TVY 101 

□ > gi | 27672343 | ref [XP 221755. l| similar to hypothetical protein MGC2408 [Homo sap 

norvegicus] 
Length = 2 56 

Score =26.9 bits (56), Expect = 30 

Identities = 10/16 (62%) , Positives = 12/16 (75%) , Gaps = 2/16 (12%) 

Query: 1 PYKWPF-HFAFFHQIL 15 

PY+W F FAFFH +L 
Sbjct: 130 PYEW - FGDFAFFHALL 144 

□> gi | 28871387 | ref |NP 794006. l| hypothetical protein [Pseudomonas syringae pv. torn 
gi | 2 885463 8 [ gb | AAO577 01 . 1 | hypothetical protein [Pseudomonas syringae pv. tomato 
Length = 1111 

Score =26.9 bits (56), Expect = 30 
Identities = 7/8 (87%), Positives = 8/8 (100%) 

Query: 8 FAFFHQIL 15 

FAFFHQ+L 
Sbjct: 728 FAFFHQLL 735 

□ > gi | 23128861 1 ref | ZP 00110699. l[ hypothetical protein [Nostoc punctiforme] 

Length = 2181 

Score =26.9 bits (56), Expect = 30 
Identities = 8/12 (66%) , Positives = 8/12 (66%) 

Query: 10 FFHQILATSTIA 21 

FFH I A TIA 
Sbjct: 595 FFHHIFANPTIA 606 

□ > gi | 212 9313 8 [gb | EAA05283 . 1 | ENSANGP00000008298 [Anopheles gambiae str. PEST] 

Length = 602 

Score = 26.5 bits (55), Expect = 40 
Identities = 8/12 (66%) , Positives = 9/12 (75%) 

Query: 15 LATSTIAMSHCR 2 6 

L TSTI + HCR 
Sbjct: 580 L VTS T I VLEHCR 591 

□ > gi | 9801567 |gb|AAF97943 .2 | cytochrome P450 CYP6N4v3 [Aedes albopictus] 

Length =216 

Score = 26.5 bits (55), Expect = 40 

Identities = 11/21 (52%), Positives = 13/21 (61%), Gaps = 6/21 (28%) 

Query: 8 FAFFHQILA TSTIAMSHC 25 

F FF LA TS+ AMS+C 
Sbjct: 17 FVFF LAGFETSSTAMSYC 34 

□ > gi | 98 01568 |gb|AAF97944 .2 | cytochrome P450 CYP6N4v4 [Aedes albopictus] 

Length =216 

Score = 26.5 bits (55), Expect = 40 

Identities = 11/21 (52%), Positives = 13/21 (61%), Gaps = 6/21 (28%) 



6 of 13 



5/20/03 3:10 PM 



1110=1053457733-016164-25584, http://www.ncbi.nlm.nih.gov^ast/Blastxgi 

CI' 

Query: 8 FAFFHQILA TSTIAMSHC 2 5 

F FF LA TS+ AMS+C 
Sbjct: 17 FVFF LAGFETSSTAMSYC 34 

□ > gi | 9801569 [gb|AAF97945 .2 | cytochrome P450 CYP6N4v5 [Aedes albopictus] 

Length = 216 

Score = 26.5 bits (55) , Expect = 40 

Identities = 11/21 (52%), Positives = 13/21 (61%), Gaps = 6/21 (28%) 

Query: 8 FAFFHQILA- - -TSTIAMSHC 25 

F FF LA TS+ AMS+C 
Sbjct: 17 FVFF ---LAGFETSSTAMSYC 34 

E3> gi | 2 8564 982 [gb| AAQ3 8684 . 1 [ truncated PDR5 [Saccharomyces kluyveri] 
Length = 1023 

Score = 26.5 bits (55), Expect = 40 

Identities = 12/20 (60%), Positives = 13/20 (65%), Gaps = 4/20 (20%) 

Query: 7 HFAFFHQILA- -TSTIAMSH 24 

HF FF+ L TST AMSH 
Sbjct: 639 HF-FFY-FLVSVTSTFAMSH 656 

□ >gi| 15078968 | ref | NP 14 9719 . 1 1 256R [Invertebrate iridescent virus 6] 
gi 1 15042 3 37 [gb | AAK82 117 . 1 | AF3 03 741 256 256R [Chilo iridescent virus] 

Length = 78 

Score = 26.5 bits (55), Expect - 40 
Identities = 6/9 (66%), Positives = 8/9 (88%) 

Query: 2 YKWPFHFAF 10 

++W FHFAF 
Sbjct: 3 9 FRWSFHFAF 47 

... D >gi 1 1 5 641 10 2 | ref |NP ,23 0734.. 1. [ periplasmic binding protein-related protein [Vibri 

gi | 11345643 | pir | |A82245 periplasmic binding protein-related protein VC1089 [impor 

Vibrio cholerae (strain N16961 serogroup 01) 
gi | 9655557 | gb | AAF9424 8 . 1 | periplasmic binding protein-related protein [Vibrio cho 
Length = 2 70 

Score = 26.5 bits (55), Expect = 40 
Identities = 6/9 (66%) , Positives = 8/9 (88%) 

Query: 5 PFHFAFFHQ 13 

P+H+ FFHQ 
Sbjct: 80 PYHYTFFHQ 88 

□ > gi 1 19114288 | ref [NP 593376.1] hypothetical protein [Schizosaccharomyces pombe] 



21 



3219957 



7491179 



sp | P87136 | YDM5 SCHPO Hypothetical protein C57A7.05 in chromosome I 



pir | | T38949 hypothetical protein SPAC57A7.05 - fission yeast 



(Schizosaccharomyces pombe) 
gi | 210444 0 [ emb [ CAB0 8763 . 1 \ hypothetical protein; similar to S. cerevisiae YDL231C 
contains 11 predicted transmembrane helices; 
leucine-serine rich [Schizosaccharomyces pombe] 
Length = 1337 

Score = 26.5 bits (55), Expect = 40 
Identities = 6/6 (100%), Positives = 6/6 (100%) 

Query: 3 KWPFHF 8 

KWPFHF 
Sbjct: 1308 KWPFHF 1313 
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f- 

□ > gi | 9801570 [ gb | AAF97946 . 2 | cytochrome P450 CYP6N4v6 [Aedes albopictus] 

Length =216 

Score = 26.5 bits (55), Expect = 40 

Identities = 11/21 (52%), Positives = 13/21 (61%), Gaps = 6/21 (28%) 

Query: 8 FAFFHQILA TSTIAMSHC 2 5 

F FF LA TS+ AMS+C 
Sbjct: 17 FVFF LAGFETSSTAMSYC 34 

□ > gi | 9801564 | gb | AAF97940 . 2 [ cytochrome P45 0 CYP6N4vl [Aedes albopictus] 

Length = 43 5 

Score = 26.5 bits (55), Expect = 40 

Identities = 11/21 (52%), Positives = 13/21 (61%), Gaps = 6/21 (28%) 

Query: 8 FAFFHQILA TSTIAMSHC 25 

F FF LA TS + AMS+C 
Sbjct: 2 99 FVFF LAGFETSSTAMSYC 316 

□> gi | 729930 1 sp[ P16270 |LECN PEA NONSEED LECTIN PRECURSOR 
gi |3 09675 | gb | AAA33 675 . 1 1 lectin 
Length = 2 65 

Score =26.1 bits (54), Expect = 53 
Identities = 8/9 (88%), Positives = 9/9 (100%) 

Query: 16 ATSTIAMSH 24 

ATSTIA+SH 
Sbjct: 235 ATSTIAVSH 243 

□ > gi 1 30260580 |ref | NP 842957. l| hypothetical protein [Bacillus anthracis str. Ames 
gi | 3 0253 94 8 |.gb | AAP24443 . 1 j hypothetical protein [Bacillus anthracis str. Ames] 

Length =92 0 

Score = 26.1 bits (54), Expect = 53 
Identities = 7/8 (87%), Positives = 8/8 (100%) 

Query: 11 FHQILATS 18 

FHQI+ATS 
Sbjct: 43 9 FHQIMATS 4 46 

□ > gi 1 222 934 95 | emb [ CAD31853 . 1 1 putative chemosensory receptor 7 [Heliothis viresce 

Length =3 97 

Score = 26.1 bits (54), Expect = 53 

Identities = 10/17 (58%), Positives = 11/17 (64%), Gaps = 5/17 (29%) 

Query: 1 PY- -K-WPFHFAFFHQI 14 

PY K WPF A+ HQI 
Sbjct: 174 PYELKYWPF- -AYIHQI 188 

□ > gi [26987557 1 ref | NP 742982. l| conserved hypothetical protein [Pseudomonas putida 



gi | 24982231 | gb | AAN66446 . 1 1 AE016274 3 conserved hypothetical protein [Pseudomonas 



Length = 136 
Score = 26.1 bits (54), Expect = 53 

Identities = 12/30 (40%), Positives = 14/30 (46%), Gaps = 15/30 (50%) 

Query: 16 ATSTIAMSHC RPTVYKQ 32 

A+ST+AMS C R VYKQ 

Sbjct: 42 ASSTVAMSACIGAETQVQDQRLNR- -VYKQ 69 
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□ > gil30023459|ref |NP 835090. l| Beta- 1 , 3 -N-acetylglucosaminyltransf erase [Bacillus 

14579] 

gi [2 98 9902 0 | gb | AAP122 91 . 1 | Beta- 1 , 3 -N-acetylglucosaminyltransf erase [Bacillus cer 
14579] 
Length = 32 6 

Score = 26.1 bits (54), Expect = 53 

Identities = 12/24 (50%) , Positives = 15/24 (62%) , Gaps = 8/24 (33%) 

Query: 9 AFF-HQILA-TST IAMSH 24 

AF HQ++A TST IAM+H 
Sbjct: 148 AFLTHQ WAQTS TW I FKRS I AMNH 171 

□ > gi 1 17230565 1 ref |NP 487113. l| probable glycosyl transferase [Nostoc sp. PCC 7120 
gi | 25530026 |pir | |AB2190 hypothetical protein alr3073 [imported] - Nostoc sp. (str 

7120) 

gi | 17132167 | dbj [BAB74772 . 1 | 0RF_ID : alr3 073 -probable glycosyl transferase [Nostoc 
Length = 324 

Score = 26.1 bits (54), Expect = 53 

Identities = 9/18 (50%), Positives = 9/18 (50%), Gaps = 8/18 (44%) 

Query: 2 YKWPF HFAFFHQIL 15 

YKWPF HQIL 
Sbjct: 185 YKWPFVWKQ HQIL 198 

□ > gi|27500354|ref |XP 211309. l| hypothetical protein XP_2113 09 [Homo sapiens] 

Length = 132 

Score = 26.1 bits (54), Expect = 53 
Identities = 7/8 (87%), Positives = 8/8 (100%) 

Query: 11 FHQILATS 18 

FHQIL+TS 
Sbjct: 61 FHQILSTS 68 

□ > gi 1 13812138 | ref |NP 113265. l| chromosomal region maintenance protein CRM1 [Guill 
gi | 25396709lpir | |F90086 chromosomal region maintenance protein CRM1 [imported] - 

theta nucleomorph 

gi | 1379445 0 |gb|AAK3 982 5 . 1 |AF165818 3 3 chromosomal region maintenance protein CRM1 
Length =94 9 

Score = 26.1 bits (54), Expect = 53 

Identities = 10/15 (66%) , Positives = 11/15 (73%) , Gaps = 1/15 (6%) 

Query: 10 FFHQILAT-STIAMS 23 

FFH+IL STI MS 
Sbjct: 358 FFHKILINLSTILMS 372 

□ > gi |27706584 |ref |XP 242294. l| hypothetical protein XP_2422 94 [Rattus norvegicus 

Length =415 

Score = 26.1 bits (54), Expect = 53 

Identities = 8/11 (72%), Positives = 10/11 (90%), Gaps = l/ll (9%) 

Query: 17 TSTIAMSHCRP 2 7 

TS +AMS+CRP 
Sbjct: 338 TS-VAMSNCRP 34 7 

□ > gi 1 100052 |pir | [S20988 lectin - garden pea 
gi | 169172 |gb|AAA33 691. 1 [ vegetative lectin 

Length = 265 

Score =26.1 bits (54), Expect = 53 
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Identities = 8/9 (88%), Positives = 9/9 (100%) 

Query: 16 ATSTIAMSH 24 

ATSTIA+SH 
Sbjct: 235 ATSTIAVSH 243 

D> gi 1 7510844 |pir| |T29765 hypothetical protein ZC581.4 - Caenorhabditis elegans 
Length = 209 

Score = 26.1 bits (54), Expect = 53 

Identities = 9/15 (60%), Positives = 9/15 (60%), Gaps = 4/15 (26%) 

Query: 5 PFHF- -AFFHQ- - IL 15 

PFHF FF Q IL 
Sbjct: 57 PFHFPQTFFQQPHIL 71 

□> gi | 21398366 | ref |NP 654351. l| hypothetical protein predicted by GeneMark [Bacill 
A2012] 
Length =92 0 

Score = 26.1 bits (54), Expect = 53 
Identities = 7/8 (87%), Positives = 8/8 (100%) 

Query: 11 FHQILATS 18 

FHQI+ATS 
Sbjct: 43 9 FHQIMATS 446 

□ > gi | 557942 8 |gb| AAD45549. 1 [U703 76 14 SpcE [Streptomyces netropsis] 

Length = 427 

Score =26.1 bits (54), Expect = 53 

Identities = 10/15 (66%) , Positives = 11/15 (73%) , Gaps = 3/15 (20%) 

Query: 6 FH- -FAFFHQ-ILAT 17 

FH FAF HQ +LAT 
Sbjct: 3 91 FHTCFAFLHQALLAT 4 05 

D> gi | 630666 |pir | |S40748 hypothetical protein F54C8.6 - Caenorhabditis elegans 
Length = 3 09 

Score =25.7 bits (53), Expect = 72 

Identities = 11/22 (50%) , Positives = 14/22 (63%) , Gaps = 7/22 (31%) 

Query: 7 HFA--FFHQ ILAT-STIA 21 

HF FFHQ IL++ +TIA 
Sbjct: 223 HFWQFFHQIIHLILSSIATIA 244 

□ > gi | 2 8917410 |gb [EAA27114 . 1 1 hypothetical protein [Neurospora crassa] 

Length =62 0 

Score =25.7 bits (53), Expect = 72 
Identities = 6/8 (75%), Positives = 6/8 (75%) 

Query: 1 PYKWPFHF 8 

PYKW F F 
Sbjct: 3 6 PYKWTFQF 4 3 

□ > gi 1 23612355 | ref [NP 703935. l| transport in [Plasmodium falciparum 3D7] 
gi |234 98596|emblCAD50547.l| transportin [Plasmodium falciparum 3D7] 

Length = 1147 

Score = 25.7 bits (53), Expect = 72 
Identities = 7/8 (87%), Positives = 7/8 (87%) 
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21287767 



25148238 



ref 



7505805 [pir | 



27695175 



17558564 



30145805 



ref 
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emb 



hypothetical protein XP_153478. 
SNF2 family dna repair protein, 
serine (or cysteine) proteinas. 
Glia derived nexin precursor (G. 
ENSANGP00000017962 [Anopheles ga . 

Protein kinase family member (. 

hypothetical protein 
XP 223369. l| similar to low 
Predicted 
elegans 



gblEAA00088.l| 
499962.2 



NP 



T34435 



NP 506763 .1 



CAB02832 .2 | C. 



CDS, 
SRW-24 



K11H12.9 - Caen, 
density lipopro. 
putative plasma, 
protein (corr. 



17555608 



17553568 



25342678 



21245069 



24374681 



10946860 



27672343 



28871387 



20094540 



ref 



630666 | pir | | 340748" hypothetical protein F54C8 . 6 - Caenor 
NP 498145. l| Predicted CDS, putative membra 
NP 499077. i| Predicted CDS, putative membra 
[B88555 protein F54C8.6 [imported] - Caenor 
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NP 644651.1 



NP 718724.1 
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NP 794006.1 



NP 614387.1 



15078968 



30177923 



15641102 



ref 



NP 230734.1 



conserved hypothetical protein. 
DnaJ domain protein [Shewanell. 
TATA box binding protein (Tbp) . 
similar to hypothetical protei. 
hypothetical protein [Pseudomo. 
Uncharacterized membrane prote. 
9632199|ref |NP 048911. l| a555R [Paramecium bursaria Chlo. 

ref|NP 149719. l| 256R [Invertebrate iridescent . 
gb 1 EAA45104 . 1 | ENSANGP000000232 94 [Anopheles ga. 

periplasmic binding protein-re. 
hypothetical protein [Schizosa. 
CAD31853.1| putative chemosensory receptor . 

probable glycosyl transferase . 
hypothetical membrane protein . 
ZP 00028030. l| hypothetical protein [Burkho. 
NP 126924.1 | CARBON STARVATION PROTEIN A. [. 
NP 501719.1 | 3-oxo-5-alpha-steroid 4-dehydr. 
T29765 Hypothetical protein ZC581 . 4 - Caeno . 
NP 195444.1 | ATPase, E1-E2 type family [Ara . 
gb|AAD45549.l|U70376 14 SpcE [Streptomyces netro. 
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17230565 
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7510844 



15235511 ref 



5579428 
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NP 593376.1 



NP 487113.1 



NP 600312.1 



15606525 



ref 



21242454 



ref 



NP 213905.1 



NP 642036.1 



28917410 



23612355 



ref 



17297993 



30177001 



dbj 



27366492 



fib 



ref 



NP 762019.1 



15903492 



ref 



NP 359042.1 



13812138 



ref 



NP 113265.1 



13476001 



ref 



NP 107571.1 



30020491 



ref 



NP 832122.1 



emb|CAB90352.l|~ putative metal ATPase [Arabidops . 

NADH dehydrogenase I chain M [ . 

, MFS transporter [Xanthomonas a. 

gblEAA27114.l| "^hypothetical protein [Neurospora. 
NP703935 . 1 | transportin [Plasmodium falcip. 
BAB78507 . 1 | transportin [Plasmodium falcipa. 
EAA08763.2 | ENSANGP0000002 13 06 [Anopheles ga. 

Cation/multidrug efflux pump [. 
Conserved hypothetical protein, 
chromosomal region maintenance. 
ABC transporter, substrate bin. 
hypothetical protein [Bacillus, 
conserved hypothetical protein, 
retroelement pol polyprotein - . 
putative f errichrome ABC trans . 
conserved hypothetical protein. 
BAC16541 . 1 | ferric vibriof errin transport s . 

similar to olfactory receptor . 
Amino acid permease family mem. 
putative permease [Streptomyce . 
Multidrug resistance protein B. 
putative TYPE II DNA MODIFICAT. 
AAK92 685 . 1 | ACQ 907 14 18 putative retrotranspo . 

S44629 F22B7.10 protein - Caenorhabditis ele. 

16944534 | emb | CAD 11330 . 1 [ hypothetical protein [Neurospor. 
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NP 491002 . 1 



NP 630316.1 



NP 830472.1 



NP 223347.1 
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9 1 


2 6251614 |gb|AAN84 835.1 | 


Hypothetical protein C55C2.5C [C. . . 


25 


97 




9 A 


1477386 |gb|AAC37338 . 1 | This CDS feature is included to s... 


25 


97 




9i 


24666563 


ref|NP 649079.2 | CG6836-PA [Drosophila melanoga. . . 


25 


97 


D 


9 1 


17506263 


ref |NP 491003 . 1 


Amino acid permease family mem. . . 


25 


97 


D 


gi 


17861760 


gb| AAL39357 . 1 | 


GH26215p [Drosophila melanogaster] 


25 


97 


D 


gi 


17555266 


ref 


NP 497282.1 


Putative membrane protein, wit. . . 


25 


97 


D 


ai 


26336525 


dbj 


BAC31945 . 1 | 


unnamed protein product [Mus mu. . . 


25 


97 


D 




21231138 


ref 


NP 637055.1 


MFS transporter [Xanthomonas c. . . 


25 


97 






30840221 


emb 


CAD61285 . 1 | 


fertility restorer [Raphanus sa. . . 


25 


97 




ai 


17552842 


ref 


NP 498909.1 


DumPY : shorter than wild- type... 


25 


97 


D 




21398624 


ref 


NP 654609.1 


sugar tr, Sugar (and other) tr. . . 


25 


97 




gi 


28564962 


gb|AA032565.l| 


FKS1 [Saccharomyces kluyveri] 


~25 


130 




9 1 


3914053 | sp|013396 | MSH2 NEUCR DNA mismatch repair protein... 


25 


130 




91 


20837204 


ref 


XP 158536.1 


hypothetical protein XP 158536... 


25 


130 


D 


9 1 


29349732 


ref 


NP 813235.1 


conserved hypothetical protein. . . 


25 


130 




gi 


23273294 


gb|AAH34950 . l| 


TANK-binding kinase 1 [Homo sapi . . . 


25 


130 


D 


cri 


19923703 


ref 


NP 037056.2 | CD44 antigen; Cell surface gly. . . 


25 


130 


D 


91 


7019547 | ref | NP 037386. l| 


TANK-binding kinase 1; NF-kB-ac... 


Z D 




11 

■a 


9 1 


26991318 


ref 


NP 746743.1 


conserved hypothetical protein. . . 


Z O 






qi 


15791673 


ref 


NP 281496.1 


hypothetical protein Cj 0305c [... 


25 


130 




gi 


21595149 


gb|AAH32196.1 | 


Unknown (protein for MGC:38244) ... 


~~25 


130 




gi 


30022512 


ref 


NP 834143.1 


ABC transporter ATP-binding pr. . . 


25 


130 




Sf 1 


29377576 


ref 


NP 816730.1 


serine/threonine protein kinas . . . 


25 


130 




gi 


27706584 


ref 


XP 242294.1 


hypothetical protein XP 242294... 


25 


130 


D 


cii 


28521689 


ref 


XP 288504.1 


hypothetical protein XP 288504... 


25 


130 


B 


gi 


22957373 


gblZP 00005078. l| hypothetical protein [Rhodoba. . . 


~25 


130 




cri 


17543364 


ref 


NP 501353.1 


Putative nuclear protein, with. . . 


25 


130 


D 


ai 


21402483 


ref 


NP 658468.1 


ABC tran, ABC transporter [Bac . . . 


25 


130 




gi 


27368877 


emb 


CAD59596.l| 


MRP-like ABC transporter [Oryza... 


25 


130 




gi 


27718495 


ref 


XP 216902.1 


similar to TANK-binding kinase. . . 


25 


130 


D 


gi 


21397994 


ref 


NP 653979.1 


hypothetical protein predicted. . . 


24 


174 




a 1 


15800733 


ref 


NP 286747.1 


unknown [Escherichia coli 0157... 


24 


174 




9i 


17456139 


ref 


XP 051362.2 


similar to Transmembrane 6 sup... 


24 


174 


D 


gi 


84991 |pir| |A20647 heat shock protein 22 - fruit fly (Dro. . . 


24 


174 


D 


9 r 


1347338l|ref 


NP 104 9 4 8. 1 


hypothetical protein [Mesorhiz.. . 


24 


" 174 




gi 


17569347 | ref 


NP 510111.1 


Putative endoplasmic reticulum. . . 


24 


174 


D 


Qi 


27666390 


ref 


XP 221376.1 


similar to hypothetical protei . . . 


24 


174 


D 


gi 


631889 |pir | | C48213 syntaxin 2 - rat 


24 


174 




gi 


27707330 


ref 


XP 242328.1 


hypothetical protein XP_242328... 


24 


174 


H 


gi 


15830651 


ref 


NP 309424.1 


hypothetical protein [Escheric... 


24 


174 




gi 


13124733 


sp|P02515 |HS22 DROME Heat shock protein 22 


24 


174 





Alignments 



Get selected sequences 



Select all 



Deselect all 



□ > gi|20909630lref |XP 153478. l[ hypothetical protein XP_153478 [Mus musculus] 

Length = 197 

Score = 2 9.9 bits (63), Expect =3.8 
Identities = 7/7 (100%), Positives = 7/7 (100%) 



Query: 4 WPFHFAF 10 

WPFHFAF 
Sbjct: 12 WPFHFAF 18 

□ > gi [ 19113394 I ref |NP 596602. l| SNF2 family dna repair protein by similarity [Schi 
pombe] 

gi I 7492644 I pir I |T39936 probable helicase - fission yeast (Schizosaccharomyces pom 
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gi | 311612 0 [ emb | CAA1887 0 . 1 | zinc finger protein; zf-C3HC4 type (RING finger); SNF2 
helicase; DEAD box helicase; putative DNA repair protein 
[Schizosaccharomyces pombe] 
Length = 104 0 

Score = 29.9 bits (63), Expect =3.8 

Identities = 11/16 (68%) , Positives = 12/16 (75%) , Gaps = 1/16 (6%) 



Query: 13 QILATIAMSHCRPTVY 2 8 

+ IL I A SH RPTVY 
Sbjct: 458 EILTKIAPSH- RPTVY 472 

D > gi | 27685447 1 ref [XP 217453. l| serine (or cysteine) proteinase inhibitor, clade 

[Rattus norvegicus] 
Length = 3 98 



Score = 29.5 bits (62), Expect = 5.1 

Identities = 12/27 (44%) , Positives = 15/27 (55%) , Gaps = 8/27 (29%) 



Query: 4 WPFHFAFFHQILATIAMSHCRPTVYKQ 3 0 

W HF FF IL T+ +S +VY Q 

Sbjct: 3 W- -HFPFF- -ILTTVTLS SVYSQ 21 



□ > gi 1 121111 1 sp | P07092 | GDN RAT Glia derived nexin precursor (GDN) (Protease nexin 



gi | 92273 |pir 1 |B27496 proteinase inhibitor nexin 1 precursor - rat (fragment) 



gi | 2 042 84 [gb| AAA412 09 . 1 | GDN precursor 

Length =3 97 

Score = 29.5 bits (62), Expect = 5.1 

Identities = 12/27 (44%) , Positives = 15/27 (55%) , Gaps = 8/27 (29%) 



Query: 4 WPFHFAFFHQILATIAMSHCRPTVYKQ 3 0 

W HF FF IL T+ +S +VY Q 

Sbjct: 3 W- -HFPFF- -ILTTVTLS SVYSQ 21 



. Q> gi | 212 8.7767j^g,bj EM.00088..1 1. ENSANGP00000017962 [Anopheles gambiae str. PEST] 
Length =314 

Score =29.1 bits (61), Expect = 6.9 

Identities = 19/47 (40%) , Positives = 22/47 (46%) , Gaps = 20/47 (42%) 

Query: 4 WPFHFAFFH Q 1 LA- - TI - -AMSHCRPTVYKQTSII 34 

WP+H A FH Q IL TI A SHC VY Q+ +1 
Sbjct: 52 WPWHVALFHRKDAQYEYACGGS ILDENTILTA- SHC VYTQSGVI 94 

□ > gi | 25148238 | ref |NP 499962. 2| Protein kinase family member (42.4 kD) [Caenorhabd 
gi | 1797653 0 |gb[AAC244 03 .2 | Hypothetical protein K11H12.9 [Caenorhabditis elegans] 
Length =3 76 



Score = 28.6 bits (60), Expect = 9.2 
Identities = 8/11 (72%) , Positives = 9/11 (81%) 



Query: 12 HQILATIAMSH 22 

HQ I LA +AM H 
Sbjct: 203 HQ I L AAVAMMH 213 

□ > gi | 7505805 |pir | |T34435 hypothetical protein K11H12.9 - Caenorhabditis elegans 
Length =3 74 

Score = 28.6 bits (60), Expect = 9.2 
Identities = 8/11 (72%) , Positives = 9/11 (81%) 

Query: 12 HQILATIAMSH 22 
HQ I LA +AM H 
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Sbjct: 201 HQ I L AAVAMMH 211 

D > gi | 27695175 | ref [XP 223369. l| similar to low density lipoprotein receptor-relat 

low density lipoprotein-related protein 4; Low Density 
Lipoprotein Receptor Related Protein 4; corin [Mus 
musculus] [Rattus norvegicus] 
Length - 276 

Score = 28.2 bits (59), Expect = 12 
Identities = 7/10 (70%), Positives = 9/10 (90%) 

Query: 6 FHFAFFHQIL 15 

FHF+ FHQ+L 
Sbjct: 2 FHFSLFHQVL 11 

D > gi | 17558564 1 ref |NP 506763. l| Predicted CDS, putative plasma membrane membrane 

member, with at least 5 transmembrane domains, nematode 
specific [Caenorhabditis elegans] 
gi | 7497294 [pir[ |T19886 hypothetical protein C41G6.7 - Caenorhabditis elegans 
Length =4 05 

Score =28.2 bits (59), Expect = 12 

Identities = 13/23 (56%), Positives = 15/23 (65%), Gaps = 4/23 (17%) 

Query: 13 Q ILATI AMS - HCRPTVYKQTS I I 34 

IL +1 MS HCR TV K SI+ 
Sbjct: 350 HILVSIMMSSHCRDTV-K- -SIV 369 

D > gi 1 3 0145805 | emb | CAB02 832 . 2 | C. elegans SRW-24 protein (corresponding sequence C 
[Caenorhabditis elegans] 
Length =418 

Score = 28.2 bits (59), Expect = 12 

Identities = 13/23 (56%), Positives = 15/23 (65%), Gaps = 4/23 (17%) 

Query: 13 Q ILATI AMS -HCRPTVYKQTS I I 34 

IL +1 MS HCR TV K SI + 
Sbjct: 363 HILVSIMMSSHCRDTV-K- -SIV 3 82 

□ > gi | 630666 |pir | |S40748 hypothetical protein F54C8 . 6 - Caenorhabditis elegans 
Length = 3 09 

Score =27.8 bits (58), Expect = 17 

Identities = 11/22 (50%), Positives = 12/22 (54%), Gaps = 9/22 (40%) 

Query: 7 HFA--FFHQI LATIA 19 

HF FFHQI +ATIA 
Sbjct: 223 HFWQFFHQIIHLILSSIATIA 244 

□> gi | 17555608 | ref |NP 498145. l| Predicted CDS, putative membrane protein family m 

least 4 transmembrane domains, nematode specific 
[Caenorhabditis elegans] 
gi | 73 31800 [gb | AAF60488 . 1 | Hypothetical protein Y32H12A.1 [Caenorhabditis elegans] 
Length =310 

Score = 27.8 bits (58), Expect = 17 
Identities = 8/11 (72%) , Positives = 9/11 (81%) 

Query: 11 FHQ I LAT I AMS 21 

FHQILA + MS 
Sbjct: 47 FHQILAIVLMS 57 

□> gi | 17553568 | ref |NP 499077. l| Predicted CDS, putative membrane protein, with a 

domain, nematode specific [Caenorhabditis elegans] 
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2i 



2i 



20141894 



15718117 



sp | P34444 | YL86 CAE EL Hypothetical protein F54C8.6 in chromosome III 



emb 1 CAA80158 . 2\ Hypothetical protein F54C8.6 [Caenorhabditis elegans] 



Length - 265 
Score = 27.8 bits (58), Expect = 17 

Identities = 11/22 (50%) , Positives = 12/22 (54%) , Gaps = 9/22 (40%) 

Query: 7 HFA--FFHQI LATIA 19 

HF FFHQI +ATIA 
Sbjct: 223 HFWQFFHQI IHLILSS IATIA 244 

D> gi | 25342678 |pir | [B88555 protein F54C8.6 [imported] - Caenorhabditis elegans 
Length =3 06 

Score = 27.8 bits (58), Expect = 17 

Identities = 11/22 (50%) , Positives = 12/22 (54%) , Gaps = 9/22 (40%) 

Query: 7 HFA- - FFHQI LATIA 19 

HF FFHQI +ATIA 
Sbjct: 264 HFWQFFHQI IHLILSS I AT I A 285 

D> gi|21245069lref |NP 644651. l| conserved hypothetical protein [Xanthomonas axonop 
str. 306] 

gi | 21110 8 02 [ gb | AAM3 9187 . 1 | conserved hypothetical protein [Xanthomonas axonopodis 
str. 306] 
Length =12 9 

Score = 27.4 bits (57), Expect = 22 
Identities = 7/9 (77%), Positives = 7/9 (77%) 

Query: 3 KWPFHFAFF 11 

KW F FAFF 
Sbjct: 12 KWQFRFAFF 2 0 

□ > gi | 24374681 1 ref [NP 718724. l| DnaJ domain protein [Shewanella oneidensis MR-1] 
gi | 2434932 6 [gb | AAN56168 . 1 | AEO 15754 2 . DnaJ domain protein [Shewanella oneidensis M 

Length =4 02 

Score = 26.9 bits (56), Expect = 30 

Identities = 9/11 (81%), Positives = 9/11 (81%), Gaps = 1/11 (9%) 

Query: 6 FHFAFFHQILA 16 

FH AFF QILA 
Sbjct: 170 FH-AFFEQILA 179 

□ > gi | 10946860 | ref [NP 067441. i| TATA box binding protein (Tbp) -associated factor, 

A [Mus musculus] 
gi | 18422 04 | emb | CAA71091 . 1 [ TAFI48 protein [Mus musculus] 

Length = 453 

Score = 26.9 bits (56), Expect = 30 

Identities = 7/9 (77%), Positives = 8/9 (88%), Gaps = 1/9 (11%) 

Query: 4 WP-FHFAFF 11 

WP FHF+FF 
Sbjct: 380 WPAFHFSFF 388 

□ > gi 1 27672343 | ref [XP 221755. l| similar to hypothetical protein MGC2408 [Homo sap 

norvegicus] 
Length = 2 56 

Score =26.9 bits (56), Expect = 30 

Identities = 10/16 (62%) , Positives = 12/16 (75%) , Gaps = 2/16 (12%) 
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Query: 1 PYKWPF-HFAFFHQIL 15 

PY+W F FAFFH +L 
Sbjct: 130 PYEW - FGDFAFFHALL 144 

D > gi [ 28871387 1 ref | NP 794006. l| hypothetical protein [Pseudomonas syringae pv. torn 
gi|28854638|gb | AAO57701 . 1 | hypothetical protein [Pseudomonas syringae pv. tomato 
Length = 1111 

Score =26.9 bits (56), Expect = 30 
Identities = 7/8 (87%), Positives = 8/8 (100%) 

Query: 8 FAFFHQIL 15 

FAFFHQ+L 
Sbjct: 72 8 FAFFHQLL 73 5 

n > gi | 20094540 | ref 1 NP 614387. l| Uncharacterized membrane protein specific for M.ka 

family [Methanopyrus kandleri AV19] 
gi|l9887659[gb| AAM02317 . 1 | Uncharacterized membrane protein specific for M.kandle 
family [Methanopyrus kandleri AV19] 
Length = 244 

Score =26.9 bits (56), Expect = 30 

Identities = 10/22 (45%), Positives = 11/22 (50%), Gaps = 8/22 (36%) 

Query: 4 WPFHFAFFHQILA TIAM 2 0 

W FAFF Q+L TI M 

Sbjct: 2 7 W FAFFPQVLTFLGWLTIVM 4 5 

□ > gi 1 9632199 | ref |NP 048911. l| a555R [Paramecium bursaria Chlorella virus 1] 
2i 



21 



7461690 



2447141 



pir[ 1 T18057 hypothetical protein a555R - Chlorella virus PBCV-1 



gb | AAC96999 . 1 | a555R [Paramecium bursaria Chlorella virus 1] 



Length = 113 
Score =26.9 bits (56), Expect = 30 

Identities = 10/16 (62%), Positives = 10/16 (62%), Gaps = 4/16 (25%) 

Query: 14 ILATIA- -MSHC- -RP 25 

IL TI MSHC RP 
Sbjct: 60 ILETIVFIMSHCYKRP 75 

□ > gi 1 15078968 | ref [NP 149719. l[ 256R [Invertebrate iridescent virus 6] 
gi 1 1504233 7 jgb | AAK82 117 . 1 [AF3 03 741 256 256R [Chilo iridescent virus] 

Length =78 

Score =26.5 bits (55), Expect = 40 
Identities = 6/9 (66%), Positives = 8/9 (88%) 

Query: 2 YKWPFHFAF 10 

++W FHFAF 
Sbjct: 3 9 FRWS FHFAF 47 

□ > gi 1 3 0177 92 3 [gb|EAA45104 . l| ENSANGP000 000232 94 [Anopheles gambiae str. PEST] 

Length = 232 

Score = 26.5 bits (55), Expect = 40 

Identities = 15/36 (41%) , Positives = 16/36 (44%) , Gaps = 17/36 (47%) 

Query: 4 WPFHFAFFHQ - - - IL- -ATI - -AMSHC 23 

WP+H A FHQ IL TI A SHC 

Sbjct: 1 WPWHAAIFHQKDKHKEYACGGSILDETTILTA-SHC 35 

□> gi | 15641102 | ref |NP 230734. l| periplasmic binding protein-related protein [Vibri 
gi | 11345643 |pir[ |A82245 periplasmic binding protein-related protein VC1089 [impor 
Vibrio cholerae (strain N16961 serogroup 01) 
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gi|9655557|gb| AAF9424 8 . 1 | periplasmic binding protein-related protein [Vibrio cho 
Length = 2 70 

Score = 26.5 bits (55), Expect = 40 
Identities = 6/9 (66%), Positives = 8/9 (88%) 

Query: 5 PFHFAFFHQ 13 

P+H+ FFHQ 
Sbjct: 8 0 PYHYTFFHQ 8 8 



□ > gi 1 19114288 | ref |NP 593376. l| hypothetical protein [Schizosaccharomyces pombe] 



a 1 


3219957 


sp|P87136 


YDM5 SCHPO Hypothetical protein C57A7.05 in chromosome I 




7491179 


pir| |T38949 hypothetical protein SPAC57A7.05 - fission yeast 



(Schizosaccharomyces pombe) 



gi|2104440l emb | CAB08763 . 1 | hypothetical protein; similar to S. cerevisiae YDL231C 
contains 11 predicted transmembrane helices; 
leucine-serine rich [Schizosaccharomyces pombe] 
Length = 133 7 

Score = 26.5 bits (55), Expect = 40 
Identities = 6/6 (100%) , Positives = 6/6 (100%) 

Query: 3 KWPFHF 8 

KWPFHF 
Sbjct: 1308 KWPFHF 1313 

D > gi | 222 934 95 | emb | CAD31853 . 1 1 putative chemosensory receptor 7 [Heliothis viresce 
Length = 397 

Score = 26.1 bits (54), Expect - 54 

Identities = 10/17 (58%), Positives = 11/17 (64%), Gaps = 5/17 (29%) 

Query: 1 PY- -K-WPFHFAFFHQI 14 

PY K WPF A+ HQI 
Sbjct: 174 PYELKYWPF- -AY I HQ I 188 

TX> giT 17230565 | ref |NP 487113. l| probable glycosyl transferase [Nostoc sp . PCC 712 0 
gi | 25530026 |pir 1 |AB2190 hypothetical protein alr3073 [imported] - Nostoc sp. (str 
7120) 

gi | 17132167 | dbj [BAB74 772 . 1 | ORF_ID : alr3 073 -probable glycosyl transferase [Nostoc 
Length = 324 

Score = 26.1 bits (54), Expect = 54 

Identities = 9/18 (50%) , Positives = 9/18 (50%) , Gaps = 8/18 (44%) 

Query: 2 YKWPF HFAFFHQIL 15 

YKWPF HQIL 
Sbjct: 185 YKWPFVWKQ HQIL 198 

D > gi 1 19552310 | ref | NP 600312 . 1 1 hypothetical membrane protein [Corynebacterium glu 
13032] 

gi [ 2132 3 851 1 dbj | BAB984 77 . 1 | Hypothetical membrane protein [Corynebacterium glutam 
13032] 

Length =241 
Score = 26.1 bits (54), Expect = 54 

Identities = 10/19 (52%), Positives = 13/19 (68%), Gaps = 4/19 (21%) 

Query: 4 WPFHF- -AFFHQILATIAM 20 

WPFHF A +I+A IA+ 
Sbjct: 71 WPFHFSDAL- -RIIAAIAL 87 

□ > gi 1 22982800 | ref | ZP 00028030.1] hypothetical protein [Burkholderia fungorum] 
Length =294 
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Score = 26.1 bits (54), Expect = 54 

Identities = 10/16 (62%) , Positives = 12/16 (75%) , Gaps = 3/16 (18%) 



Query: 13 QILATIAMSHCR- - PT 2 6 

+ILATI +HCR PT 
Sbjct: 51 E I LAT I - RTHCRTVPT 65 



□ > gi 1 14521448 1 ref|NP 126924. l| CARBON STARVATION PROTEIN A. [Pyrococcus abyssi] 
gi | 7444677 | pir | [E75032 carbon starvation protein A homolog PAB1554 [similarity] - 

Pyrococcus abyssi (strain Orsay) 
gi | 5458667 | emb | CAB50154 . 1 [ cstA carbon starvation protein A [Pyrococcus abyssi] 
Length =5 92 

Score = 26.1 bits (54), Expect = 54 

Identities = 10/19 (52%), Positives = 13/19 (68%), Gaps = 5/19 (26%) 

Query: 4 WPFHFAF- -FHQILATIAM 20 

WP AF +Q+LA+IAM 
Sbjct: 493 WP AFSGMNQMLAS I AM 508 

D > gi | 17544706 | ref 1 NP 501719. l[ 3 -oxo-5 -alpha- steroid 4 -dehydrogenase family memb 

elegans] 



9 1 


7510956 


pir 


|T27720 hypothetical protein ZK1251.3 - Caenorhabditis elegans 


9 i 


3881515 


emb 


CAA92499.l| Hypothetical protein ZK1251.3 [Caenorhabditis elegans] 



Length = 2 53 



Score = 26.1 bits (54), Expect = 54 

Identities = 11/18 (61%), Positives = 11/18 (61%), Gaps = 5/18 (27%) 



Query: 8 FAFFHQI LATIAMSH 22 

FAFF I LA AMSH 
Sbjct: 214 FAFF - - I VCNLAPRAMSH 229 



D > gi | 7510844 |pir 1 |T29765 hypothetical protein ZC581.4 - Caenorhabditis elegans 
Length = 209 

Score = 26.1 bits (54), Expect = 54 

Identities = 9/15 (60%), Positives = 9/15 (60%), Gaps = 4/15 (26%) 



Query: 5 PFHF- -AFFHQ- -IL 15 

PFHF FF Q IL 
Sbjct: 57 PFHFPQTFFQQPHIL 71 



□ > gi 1 15235511 1 ref |NP 195444 . 1 1 



2i 



12643808 



25407802 



ATPase, E1-E2 type family [Arabidopsis thaliana] 

sp | Q9M3H5 | AHM1 ARATH Potential cadmium/ zinc -transporting ATPase HMA1 



pir 1 | D85440 Cu2 + - transport ing ATPase -like protein [imported] - Arabid 



thaliana 



9 1 


4006855 


emb 


CAB16773 


1 


9 1 


7270710 


emb 


CAB80393 


1 



Cu2+-transporting ATPase-like protein [Arabidopsis tha 
Cu2+-transporting ATPase-like protein [Arabidopsis tha 



Length 



819 



Score = 26.1 bits (54), Expect = 54 

Identities = 11/27 (40%), Positives = 14/27 (51%), Gaps = 11/27 (40%) 



Query: 2 YKWPFHFAFFHQILATIAMSHCRPTVY 2 8 

+KWPF L+T A CR +VY 
Sbjct: 385 FKWPF LSTAA CRGSVY 400 



□ > gi | 557942 8 | gb | AAD4554 9 . 1 |U703 76 14 SpcE [Streptomyces netropsis] 
Length = 427 

Score = 26.1 bits (54), Expect = 54 

Identities = 10/15 (66%), Positives = 11/15 (73%), Gaps = 3/15 (20%) 
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RID= 1 053457234-04 1 30- 1 70 1 0, 
» 



http://www.ncbi.nlm.nih.gov/blast/Blast.cgi 



Query: 6 FH - - FAF FHQ - I LAT 17 

FH FAF HQ +LAT 
Sbjct: 391 FHTC FAFLHQALLAT 405 

D > gi | 7710954 [ emb | CAB903 52 . 1 1 putative metal ATPase [Arabidopsis thaliana] 
Length = 819 

Score = 26.1 bits (54), Expect = 54 

Identities = 11/27 (40%) , Positives = 14/27 (51%) , Gaps = 11/27 (40%) 

Query: 2 YKWPFHFAFFHQILATIAMSHCRPTVY 2 8 

+KWPF L+T A CR +VY 
Sbjct: 385 FKWPF LSTAA CRGSVY 400 

D> gi 1 15606525 1 ref |NP 213905. l| NADH dehydrogenase I chain M [Aquifex aeolicus] 
gi | 7432313 |pir | [B70414 NADH2 dehydrogenase (ubiquinone) (EC 1.6.5.3) I chain nuoM 
Aquifex aeolicus 

gi|2983743 |gbj AAC073 04 . 1 | NADH dehydrogenase I chain M [Aquifex aeolicus VF5] 
Length = 491 

Score = 26.1 bits (54), Expect = 54 

Identities = 11/19 (57%), Positives = 12/19 (63%), Gaps = 3/19 (15%) 

Query: 1 PYKWPFHFAFFHQILATIA 19 

PY F FAFF +LA IA 
Sbjct: 370 P Y FTFAFFLAVLAG I A 385 



Score = 19.7 bits (39), Expect = 4415 
Identities = 4/4 (100%) , Positives = 4/4 (100%) 

Query: 4 WPFH 7 
WPFH 

Sbjct: 226 WPFH 229 

Q > gi | 21242454 | ref | NP 642036 . 1 1 MFS transporter [Xanthomonas axonopodis pv. citri 
gi|21107899l gb [ AAM3 6572 . 1 | MFS transporter [Xanthomonas axonopodis pv. citri str. 
Length = 474 

Score = 25.7 bits (53), Expect = 72 

Identities = 10/17 (58%), Positives = 12/17 (70%), Gaps = 2/17 (11%) 

Query: 6 FHFAFFHQILA- -TIAM 2 0 

F FAFF Q LA T+A+ 
Sbjct: 167 FAFAFFVQFLAVPTVAL 183 

D > gi | 2 8917410 | gb | EAA27114 . 1 1 hypothetical protein [Neurospora crassa] 
Length =62 0 

Score = 25.7 bits (53), Expect = 72 
Identities = 6/8 (75%) , Positives = 6/8 (75%) 

Query: 1 PYKWPFHF 8 

PYKW F F 
Sbjct: 3 6 PYKWTFQF 43 

□ > gi|23612355|ref [NP 703935. l| transportin [Plasmodium falciparum 3D7] 
gi ] 2 34 98596 | emb | CADS 054 7 . 1 | transportin [Plasmodium falciparum 3D7] 
Length = 1147 

Score = 25.7 bits (53), Expect = 72 
Identities = 7/8 (87%) , Positives = 7/8 (87%) 



lOof 13 



5/20/03 3:01 PM 



